The Yahoo Query Treebank, V. 1.0
نویسندگان
چکیده
This dataset release accompanies Pinter et al. (2016) which describes the motivation and grammatical theory. Please cite that paper when referencing the dataset. The dataset may be accessed via the Yahoo Webscope homepage1 under Linguistic Data as dataset L-28. The description in Section 2 is included within the dataset as a Readme. The dataset is sure to have annotation errors which are not covered by the special cases specified in this document. Please approach the first author for any corrections and they will appear in the next release. See Section 4 for known errors.
منابع مشابه
Netgraph Query Language for the Prague Dependency Treebank 2.0
We study the annotation of the Prague Dependency Treebank 2.0 (PDT 2.0) and assemble a list of requirements on a query language that would allow searching for and studying all linguistic phenomena annotated in the treebank. We propose an extension to the query language of an existing search tool Netgraph 1.0 and show that the extended query language satisfies the list of requirements. We demons...
متن کاملDoes Netgraph Fit Prague Dependency Treebank?
On many examples we present a query language of Netgraph – a fully graphical tool for searching in the Prague Dependency Treebank 2.0. To demonstrate that the query language fits the treebank well, we study an annotation manual for the most complex layer of the treebank – the tectogrammatical layer – and show that linguistic phenomena annotated on the layer can be searched for using the query l...
متن کاملSearching in the Penn Discourse Treebank Using the PML-Tree Query
The PML-Tree Query is a general, powerful and user-friendly system for querying richly linguistically annotated treebanks. The present paper shows how the PML-Tree Query can be used for searching for discourse relations in the Penn Discourse Treebank 2.0 mapped onto the syntactic annotation of the Penn Treebank.
متن کاملTowards a Simple and Full-Featured Treebank Query Language
Netgraph query language is a query system for linguistically annotated treebanks that aims to be sufficiently powerful for linguistic needs and yet simple enough for not requiring any programming or mathematical skill from its users. We provide an introduction to the system along with a set of examples how to search for some frequent linguistic phenomena. We also offer a comparison to the query...
متن کاملActive Learning for Building a Corpus of Questions for Parsing
This paper describes how we built a dependency Treebank for questions. The questions for the Treebank were drawn from questions from the TREC 10 QA task and from Yahoo! Answers. Among the uses for the corpus is to train a dependency parser achieving good accuracy on parsing questions without hurting its overall accuracy. We also explore active learning techniques to determine the suitable size ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1605.02945 شماره
صفحات -
تاریخ انتشار 2016